Search CORE

198 research outputs found

Contextual Refinement of Translations: Large Language Models for Sentence and Document-Level Post-Editing

Author: Exel Miriam
Huck Matthias
Koneru Sai
Niehues Jan
Publication venue
Publication date: 23/10/2023
Field of study

Large Language Models (LLM's) have demonstrated considerable success in various Natural Language Processing tasks, but they have yet to attain state-of-the-art performance in Neural Machine Translation (NMT). Nevertheless, their significant performance in tasks demanding a broad understanding and contextual processing shows their potential for translation. To exploit these abilities, we investigate using LLM's for MT and explore recent parameter-efficient fine-tuning techniques. Surprisingly, our initial experiments find that fine-tuning for translation purposes even led to performance degradation. To overcome this, we propose an alternative approach: adapting LLM's as Automatic Post-Editors (APE) rather than direct translators. Building on the LLM's exceptional ability to process and generate lengthy sequences, we also propose extending our approach to document-level translation. We show that leveraging Low-Rank-Adapter fine-tuning for APE can yield significant improvements across both sentence and document-level metrics while generalizing to out-of-domain data. Most notably, we achieve a state-of-the-art accuracy rate of 89\% on the ContraPro test set, which specifically assesses the model's ability to resolve pronoun ambiguities when translating from English to German. Lastly, we investigate a practical scenario involving manual post-editing for document-level translation, where reference context is made available. Here, we demonstrate that leveraging human corrections can significantly reduce the number of edits required for subsequent translations\footnote{Interactive Demo for integrating manual feedback can be found \href{https://huggingface.co/spaces/skoneru/contextual_refinement_ende}{here}

arXiv.org e-Print Archive

Analyzing Challenges in Neural Machine Translation for Software Localization

Author: Exel Miriam
Huck Matthias
Koneru Sai
Niehues Jan
Publication venue: Association for Computational Linguistics
Publication date: 30/06/2023
Field of study

KITopen

Discrimination, narratives and family history: an experiment with Jordanian host and Syrian refugee children

Author: Barron Kai
Harmgart Heike
Huck Steffen
Schneider Sebastian
Sutter Matthias
Publication venue: 'Botanic Garden & Botanical Museum Berlin-Dahlem BGBM'
Publication date: 01/01/2020
Field of study

We measure the prevalence of discrimination between Jordanian host and Syrian refugee children attending school in Jordan. Using a simple sharing experiment, we find only little discrimination. Among the Jordanian children, however, we see that those who descended from Palestinian refugees do not discriminate at all, suggesting that a family history of refugee status can generate solidarity with new refugees. We also find that parents' narratives about the refugee crisis are correlated with the degree of discrimination, suggesting that discriminatory preferences are being transmitted through parental attitudes

SSOAR - Social Science Open Access Repository

The Edinburgh Machine Translation Systems for IWSLT 2015

Author: Birch-Mayne Alexandra
Huck Matthias
Publication venue
Publication date: 01/12/2015
Field of study

Edinburgh Research Explorer

Mixed-Domain vs. Multi-Domain Statistical Machine Translation

Author: Birch Alexandra
Haddow Barry
Huck Matthias
Publication venue
Publication date: 01/11/2015
Field of study

Edinburgh Research Explorer

The Edinburgh/LMU Hierarchical Machine Translation System for WMT 2016

Author: Fraser Alexander
Haddow Barry
Huck Matthias
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 12/08/2016
Field of study

Edinburgh Research Explorer

Preference Grammars and Soft Syntactic Constraints for GHKM Syntax-based Statistical Machine Translation

Author: Hoang Hieu
Huck Matthias
Koehn Philipp
Publication venue
Publication date: 01/01/2014
Field of study

Crossref

Edinburgh Research Explorer

Edinburgh SLT and MT System Description for the IWSLT 2014 Evaluation

Author: Birch Alexandra
Bogoychev Nikolay
Durrani Nadir
Huck Matthias
Koehn Philipp
Publication venue
Publication date: 01/12/2014
Field of study

Edinburgh Research Explorer

Edinburgh's Statistical Machine Translation Systems for WMT16

Author: Bojar Ondrej
Haddow Barry
Huck Matthias
Nadejde Maria
Sennrich Rico
Williams Philip
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2016
Field of study

This paper describes the University of Edinburgh’s phrase-based and syntax-based submissions to the shared translation tasks of the ACL 2016 First Conference on Machine Translation (WMT16). We submitted five phrase-based and five syntaxbased systems for the news task, plus one phrase-based system for the biomedical task

Crossref

Edinburgh Research Explorer

Publikationsserver der RWTH Aachen University

Biblio at Institute of Formal and Applied Linguistics